Search CORE

15 research outputs found

Adaptive Seeding for Gaussian Mixture Models

Author: AP Dempster
C Biernacki
C Biernacki
C Bishop
G Celeux
GJ McLachlan
J-P Baudry
JJ Verbeek
JM Geusebroek
R Maitra
R Maitra
TF Gonzalez
V Melnykov
W Kwedlo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 30/05/2017
Field of study

We present new initialization methods for the expectation-maximization algorithm for multivariate Gaussian mixture models. Our methods are adaptions of the well-known

K

-means++ initialization and the Gonzalez algorithm. Thereby we aim to close the gap between simple random, e.g. uniform, and complex methods, that crucially depend on the right choice of hyperparameters. Our extensive experiments indicate the usefulness of our methods compared to common techniques and methods, which e.g. apply the original

K

-means++ and Gonzalez directly, with respect to artificial as well as real-world data sets.Comment: This is a preprint of a paper that has been accepted for publication in the Proceedings of the 20th Pacific Asia Conference on Knowledge Discovery and Data Mining (PAKDD) 2016. The final publication is available at link.springer.com (http://link.springer.com/chapter/10.1007/978-3-319-31750-2 24

arXiv.org e-Print Archive

Crossref

Estymacja parametrów modeli mieszanin rozkładów normalnych przy pomocy metody hybrydowej łączącej samoadaptacyjną ewolucję różnicową z algorytmem EM

Author: Kwedlo W.
Publication venue: Politechnika Białostocka. Oficyna Wydawnicza Politechniki Białostockiej
Publication date: 01/01/2014
Field of study

In the paper the problem of learning of Gaussian mixture models (GMMs) is considered. A new approach based on hybridization of a self-adaptive version of differential evolution (DE) with the classical EM algorithm is described. In this approach, called DEEM, the EM algorithm is run until convergence to fine-tune each solution obtained by the mutation and crossover operators of DE. To avoid the problem with parameter representation and infeasible solutions we use a method in which the covariance matrices are encoded using their Cholesky factorizations. In a simulation study GMMs were used to cluster synthetic datasets differing by a degree of separation between clusters. The results of experiments indicate that DE-EM outperforms the standard multiple restart expectation-maximization algorithm (MREM). For datasets with high number of features it also outperforms the state of-the-art random swap EM (RSEM).W pracy poruszono problem uczenia modeli mieszanin rozkładów normalnych. Zaproponowano nowe podejście, nazwane DE-EM, oparte na hybrydyzacji samoadaptacyjnego algorytmu ewolucji różnicowej i klasycznego algorytmu EM. W nowej metodzie rozwiązanie otrzymane jako wynik operatorów mutacji i krzyżowania jest poddawane optymalizacji lokalnej, prowadzonej aż do momentu uzyskania zbieżności, przez algorytm EM. Aby uniknąć problemu z reprezentacją macierzy kowariancji i niedopuszczalności rozwiązań użyto metody, w której macierze kowariancji są kodowane przy pomocy dekompozycji Cholesky’ego. W badaniach symulacyjnych modele mieszanin rozkładów normalnych zastosowano do grupowania danych syntetycznych. Wyniki eksperymentów wskazują, że metoda DE-EM osiąga lepsze wyniki niż standardowa technika wielokrotnego startu algorytmu ˙ EM. Dla zbiorów danych z dużą liczbą cech, metoda osiąga lepsze wyniki niż technika losowej wymiany rozwiązań połączona z algorytmem EM

Biblioteka Nauki - repozytorium artykuÅÃ³w

Uczenie skończonych mieszanin rozkładów normalnych przy pomocy algorytmu ewolucji różnicowej

Author: Kwedlo W.
Publication venue: Politechnika Białostocka. Oficyna Wydawnicza Politechniki Białostockiej
Publication date: 01/01/2010
Field of study

In the paper the problem of parameter estimation of finite mixture of multivariate Gaussian distributions is considered. A new approach based on differential evolution (DE) algorithm is proposed. In order to avoid problems with infeasibility of chromosomes our version of DE uses a novel representation, in which covariance matrices are encoded using their Cholesky decomposition. Numerical experiments involved three version of DE differing by the method of selection of strategy parameters. The results of experiments, performed on two synthetic and one real dataset indicate, that our method is able to correctly identify the parameters of the mixture model. The method is also able to obtain better solutions than the classical EM algorithm. Keywords: Gaussian mixtures, differential evolution, EM algorithm.W artykule rozważono problem uczenia parametrów skończonej mieszaniny wielowymiarowych rozkładów normalnych. Zaproponowano nową metodę uczenia opartą na algorytmie ewolucji różnicowej. W celu uniknięcia problemów z niedopuszczalnością chromosomów algorytm ewolucji różnicowej wykorzystuje nową reprezentację, w której macierze kowariancji są reprezentowane przy pomocy dekompozycji Cholesky’ego. W eksperymentach wykorzystano trzy wersje algorytmu ewolucji różnicowej różniące się metodą˛ doboru parametrów. Wyniki eksperymentów, przeprowadzonych na dwóch syntetycznych i jednym rzeczywistym zbiorze danych, wskazują że zaproponowana metoda jest w stanie poprawnie identyfikować parametry modelu. Metoda ta osiąga również lepsze wyniki niż klasyczyny algorytm EM

Biblioteka Nauki - repozytorium artykuÅÃ³w

Uczenie sieci neuronowych hybrydowym algorytmem opartym na differential evolution

Author: Bandurski K.
Kwedlo W.
Publication venue: Politechnika Białostocka. Oficyna Wydawnicza Politechniki Białostockiej
Publication date: 01/01/2009
Field of study

A new hybrid method for feed forward neural network training, which combines differential evolution algorithm with a gradient-based approach is proposed. In the method, after each generation of differential evolution, a number of iterations of the conjugate gradient optimization algorithm is applied to each new solution created by the mutation and crossover operators. The experimental results show, that in comparison to the standard differential evolution the hybrid algorithm converges faster. Although this convergence is slower than that of classical gradient based methods, the hybrid algorithm has significantly better capability of avoiding local optima.W artykule przedstawiono nową, hybrydową metodę uczenia sieci neuronowych, łączącą w sobie algorytm Differential Evolution z podejściem gradientowym. W nowej metodzie po każdej generacji algorytmu Differential Evolution, każde nowe rozwiązanie, powstałe w wyniu działania operatorów krzyżowania i mutacji, poddawane jest kilku iteracjom algorytmu optymalizacji wykorzystującego metodę gradientów sprzężonych.Wyniki eksperymentów wskazują, że nowy, hybrydowy algorytm ma szybszą zbieżność niż standardowy algorytm Differential Evolution. Mimo, iż zbieżność ta jest wolniejsza, niż w przypadku klasycznych metod gradientowych, algorytm hybrydowy potrafi znacznie lepiej unikać minimów lokalnych

Biblioteka Nauki - repozytorium artykuÅÃ³w

Learning decision rules using a distributed evolutionary algorithm

Author: Krętowski M.
Kwedlo W.
Publication venue: Politechnika Gdańska
Publication date: 01/01/2001
Field of study

A new parallel method for learning decision rules from databases by using an evolutionary algorithm is proposed. We describe an implementation of EDRL-MD system in the cluster of multiprocessor machines connected by Fast Ethernet. Our approach consists in a distribution of the learning set into processors of the cluster. The evolutionary algorithm uses a master-slave model to compute the fitness function in parallel. The remiander of evolutionary algorithm is executed in the master node. The experimental results show, that for large datasets our approach is able to obtain a significant speed-up in comparison to a single processor version

CiteSeerX

Biblioteka Nauki - repozytorium artykuÅÃ³w

An Evolutionary Algorithm for Cost-Sensitive Decision Rule Learning

Author: A. Giordana
C. Blake
C. Janikow
K. Jong De
K.M. Ting
P. Turney
U. Knoll
W. Kwedlo
W. Kwedlo
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Discovering Main Vertexical Planes in a Multivariate Data Space by Using CPL Functions

Author: D. Hand
L. Bobrowski
L. Bobrowski
O.R. Duda
P.S. Bradley
R. Tibshirani
R.A. Johnson
W. Kwedlo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref

Cost-Sensitive Decision Trees with Pre-pruning

Author: J. Bradford
K. Ting
K. Ting
P. Auer
P. Domingos
P. Turney
R. Holte
W. Kwedlo
Y.H. Peng
Z.H. Zhou
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2007
Field of study

Crossref

An Evolutionary Algorithm Using Multivariate Discretization for Decision Rule Induction

Author: C.Z. Janikow
J. Dougherty
J.R. Quinlan
K.A. Jong De
L. Bobrowski
U.M. Fayyad
W. Kwedlo
Z. Michalewicz
Publication venue
Publication date: 01/01/1999
Field of study

We describe EDRL-MD, an evolutionary algorithm-based system, for learning decision rules from databases. The main novelty of our approach lies in dealing with continuous - valued attributes. Most of decision rule learners use univariate discretization methods, which search for threshold values for one attribute at the same time. In contrast to them, EDRL-MD simultaneously searches for threshold values for all continuous-valued attributes, when inducing decision rules. We call this approach multivariate discretization. Since multivariate discretization is able to capture interdependencies between attributes it may improve the accuracy of obtained rules. The evolutionary algorithm uses problem specific operators and variable-length chromosomes, which allows it to search for complete rulesets rather than single rules. The preliminary results of the experiments on some real-life datasets are presented

CiteSeerX

Crossref

A New Clustering Algorithm Based on Chameleon Army Strategy

Author: A. Likas
A.K. Jain
B. Mirkin
D.L. Davies
G.W. Milligan
N. Taher
P.H.A. Sneath
P.S. Bradley
S. Saatchi
W. Kwedlo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

Crossref